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Abstract 

We present a proof for a conjecture previously formulated by Dzhafarov, Kujala, and 
Larsson (Foundations of Physics, in press, arXiv:1411.2244). The conjecture specifies a 
measure for the degree of contextuality and a criterion (necessary and sufficient condition) 
for contextuality in a broad class of quantum systems. This class includes Leggett-Garg, 
EPR/Bell, and Klyachko-Can-Binicioglu-Shumovsky type systems as special cases. In a 
system of this class certain physical properties qi,... ,qn are measured in pairs {qi, qj); every 
property enters in precisely two such pairs; and each measurement outcome is a binary random 
variable. Denoting the measurement outcomes for a property qi in the two pairs it enters by 
Vi and Wi, the pair of measurement outcomes for (qi, qj) is {Vi, Wj). Contextuality is defined 
as follows: one computes the minimal possible value Aq for the sum of Pr [Vi 7 ^ Wi] (over 
i = 1,... ,n) that is allowed by the individual distributions of Vi and Wi; one computes the 
minimal possible value Amin for the sum of Pr [Vi 7^ Wi] across all possible couplings of (i.e., 
joint distributions imposed on) the entire set of random variables Vi,Wi,... ,Vn,Wn in the 
system; and the system is considered contextual if Amin > Aq (otherwise Amin = Aq). This 
definition has its justification in the general approach dubbed Contextuality-by-Default, and it 
allows for measurement errors and signaling among the measured properties. The conjecture 
proved in this paper specifies the value of Amin — Aq in terms of the distributions of the 
measurement outcomes {Vi,Wj). 

Keywords: CHSH inequalities; contextuality; criterion for contextuality; Klyachko-Can- 
Binicioglu-Shumvosky inequalities; Leggett-Garg inequalities; measurement bias; measurement 
errors; probabilistic couplings; signaling. 


1 Introduction 

According to Acfn et al. [T], with only few exceptions, literature on contextuality mostly concerns 
particular examples and lacks general theory. Perhaps adding to the list of exceptions, two recent 
papers nia present a theory of contextuality that, although not entirely general, applies to a 
very broad class of quantum systems. Defining context as the set of physical properties that are 
measured conjointly, the novelty of this theory is in that it applies non-trivially also in the presence 
of context-dependent measurement biases (e.g., due to interactions/signaling, or imperfections in 
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the measurement procedure). When such context-dependent measurement biases are present, the 
distribution of the measurement of a given physical property may vary over different contexts. A 
system is considered noncontextual if there exists a joint distribution of the measurements of all 
contexts such that the measurements of a given physical property over different contexts are (in a 
well-defined sense) maximally correlated. 

This definition, formulated in Refs. ma, applies to all systems where each measurement has 
a finite number of possible outcomes. Analytic and computational results, however, are conhned 
to the subclass of so-called cyclic systems, where each physical property appears in two different 
contexts, each context consists of two different physical properties, and each measurement has 
two possible outcomes. This class includes Leggett-Garg, EPR/Bell, and Klyachko-Can-Binicioglu- 
Shumovsky type systems as special cases. As the main result of Ref. [3], a criterion (necessary 
and sufficient condition) was derived for a system to be contextual given the joint distributions 
of the measurements in each context. In Ref. [2] a measure of the degree of contextuality was 
defined based on how far the measurements of each physical property over different contexts are 
from being maximally correlated. This measure has a theoretically justified formulation (see below 
for details) and it can be used to define a criterion of contextuality: a system is contextual if 
and only if the degree of contextuality is positive. Computer-assisted calculations were used in 
Ref. [2] to derive an expression for the measure of contextuality for systems of 3, 4, or 5 physical 
properties. Based on these, a general expression was conjectured for any number n > 2 of physical 
properties. Interestingly, the criterion of contextuality implied by the expression conjectured in 
Ref. [2] was somewhat simpler and inherently different in form from the one derived analytically 
in Ref. [3], although both must be equivalent if the conjecture is true. In this paper, we show that 
the conjecture is indeed true, and thereby make the results complete for the special class of cyclic 
systems. 

1.1 Terminology and notation 

A cyclic (single cycle) system (see Figure 1) is defined as a system of measured properties qi,..., 

(n > 2) and measurement results (random variables) satisfying the following conditions [2l|3]: 

1. the properties are measured in pairs (gi, 92 ), • ■ • > (Qn-i, <ln), {<ln, 9i), called contexts, so that 
each property enters in precisely two contexts; 

2 . the result of measuring (g'i,( 7 i©i), i = l,...,n, is a pair of jointly distributed ±1 random 
variables (fy, Ify© 1 ), called a bunch (with © denoting circular addition: i Q) 1 = i +1 ioi i < n, 
and n © 1 = 1). 

The Leggett-Garg system [4l[5], the EPR/Bell system [6UI0]. and Klyachko-Gan-Binicioglu- 
Shumovsky system [11] are cyclic systems with n equal, respectively, 3, 4, and 5. See |2j[3] for 
details. 

The distribution of every bunch (V),Wj 0 i) is uniquely determined by the expectations (fy), 
(Wi 0 i), and (Vilfy©i). A pair {Vi,Wi} of random variables, representing the same property qi 
in different contexts, is called a connection: Vi,Wi are not jointly distributed, so the expectation 
{ViWi) is undefined. 
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Figure 1: A schematic representation of a cyclic system. For i = 1,... ,n, the measurement of qi is 
denoted Vi if qi is measured together with 9 ^ 01 ; and the measurement of qi is denoted Wi if qi is 
measured together with qtQi- As a result, the observed pairs of random variables (“bunches”) are 
(^i,VF 2 ), {V2,W3), (K,hFi). 

1.2 Contextuality 

A (probabilistic) coupling for random variables X,Y, ... is defined as any random variable 
{X* ,Y*, Z*,...) such that X* ,Y*, Z*,... are distributed as, respectively X, y,Z,.... By defi¬ 
nition of a random variable (in the broad sense of the term, including vectors and processes), the 
components of {X*, Y*, Z*,...) are jointly distributed. For simplicity, we omit asterisks and speak 
of a coupling (X, Y, Z^...) for X, F, Z,... (imposing thereby, non-uniquely, a joint distribution on 
X,Y, Z,... that otherwise may not have one). 

In relation to cyclic systems, we are interested in two types of couplings: (1) couplings (Vj, Wi) 
for the connections {Vi,Wi] {i = l,...,n); and ( 2 ) couplings ((Vi, W 2 ), (V 2 , W 3 ),..., (I 4 , IFi)) 
for the set of the observed bunches (Vi, W 2 ), (V 2 , W 3 ),..., {Vn,Wi). The latter coupling can be 
written as a random variable (Vi, W 2 , V 2 , IF 3 ,..., 14 , IFi), with the proviso that its 2 -marginals 
{Vi,Wi^i) are distributed as the corresponding bunches. Put differently, it is the coupling for the 
entire system of random variables that agrees with the observed bunches. 

For every f = 1,..., n, among all couplings for the connection {Vi, Wi\, we consider one in which 
{ViWi) = 1 — \{Vi) — (Wi)|. This coupling is called maximal, because 1 — |(Pi) — {Wi)\ is the maxi¬ 
mum possible value for fViWi) (with given fVi) and {Wi)). Equivalently, in the maximal coupling 
for {Vi,Wi} the probability Pr ^ Wi\ attains its minimum possible value: \ |(I4) — (fAi)|. It is 
clear that all connections have maximal couplings if and only if 

n 

A = ^Pr[P,4IT,] (I) 

2=1 
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is at its minimal possible value. We denote this value 

1 ” 

^0 = ( 2 ) 

^ i=l 

Clearly, in every coupling (14, ^ 1 ^ 2 , V 2 , W 3 ,... ,Vn,Wi) for the entire system the value of A is 
uniquely defined and cannot fall below Aq. This leads to the following definition of a measure 
(degree) of contextuality: it is 

CNTX = A,„i„ - Ao > 0, (3) 

where Amin is the smallest possible value of A across all couplings for the entire system. The system 
is contextual if CNTX > 0, and it is not if CNTX = 0. In Ref. [3], if a system is not contextual, it 
is said to have a maximally noncontextual description. 


1.3 Conjecture 

In Ref. [2], it was conjectured that 

1 Jsi ((RjWiei) : i = 1,... ,n) - (u - 2), 

Amin = — max < 

where the function Si (xi,... ,Xk) for any k > 1 real-valued arguments is defined as 


(4) 


k 

Si (xi,... = max^m^x^, (5) 

i^l 

with the maximum taken over all mi ... ,mk C { — 1 , 1 } such that OiLi = “ 1 - ^ follows then 
that the measure of contextuality is 

CNTX = 1 max - = '. ■ ■ ■ >") " K''-) " - (~ - 2), 

2 \ 0 , 

and a system is contextual if and only if 

n 

SimWii^i) ■.i = l,...,n)>Y,\{^)-{W^)\-{n-2). (7) 

i=l 

In Ref. [5] this was shown to be true for n = 3,4, 5, but not generally. 

In Ref. [3], a different criterion for contextuality in cyclic systems was derived: a system is 
contextual if and only if 


Si mW,Qi) ,l-\{Vi)-{W,)\:i = l,...,n)>2n-2. (8) 

Of the two criteria, the conjectured ([7]) and the proved ([ 8 |), the former is more specific, as it is 
easy to see that ([ 8 |) follows from it (which means that it is known to be a necessary condition for 
contextuality). However, this is not the main reason why one should be interested in ([7]). The 
main reason is that © follows from a conjectured formula for the fundamental theoretical quantity 
Amin whose excess (jS]) over the minimum possible value Ag is used as a measure of contextuality, 
whereas the degree of violation of (|S]) does not have any theoretically motivated interpretation as a 
measure of contextuality. 
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1.4 What we do in this paper 

It is easy to see that ([8]) and Q are equivalent for consistently-connected systems, i.e., those with 
Aq = 0. Due to the results obtained in Ref. [2], the two criteria should also be equivalent for 
n = 3,4, 5. However, it is easy to show by examples that the two inequalities, © and ([8]), are not 
algebraic variants of each other: in particular, the expressions 

n 

Si : i = 1,..., n) - ^ KR,) - (ID,) + n\ 

and 

are not equal to each other. 

Nevertheless, the two inequalities are equivalent, as we prove in Section 151 (Theorem [T51) . That 
is, ([5]) is indeed a criterion as conjectured in Ref. [5]. In Section 0] we prove the general formula © 
for the contextuality measure (Theorem [Tl]). Note that the criterion ([7]) is merely a consequence of 
the contextuality measure formula ([HI), i.e., the logical derivability diagram is 

Definition of Contextuality ([31) 


Contextuality Measure Formula (l6|) 


Criterion of Contextuality 0 

However, in this paper we arrive at the formulas for the measure and criterion in a more circuitous 
way. We first prove that the criterion ([7]) is equivalent to the previously derived criterion ([8]), and 
then we use © and the definition of contextuality to derive the measure formula ([6)) : 


Definition of Contextuality ([3|) 


Ref. [2] 


neorem 


M 


Contextuality Measure Formula 


alH 


Criterion of Contextuality ([8|) 

Theorem [HI 

Criterion of Contextuality © 


2 Results we need for the proofs 

Here we list some results from Ref. [3]. We make use of the function Si defined in m and the 
function 

k 

So (xi,... ,Xfc) = maxy^m^x^, (9) 

i^l 

with fc > 1 reaf-vafued arguments, where the maximum is taken over all mi ..., mu G {~1 ,1} such 
that m* = 1. 
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Lemma 1. For any ai,..., Un, bi,... ,bm S K, 

si(ai, ...,an,bi,...,bm) = max{ So(ai,... ,a„) + Si(5i,... ,6^), 

si(ai, ...,a„) + so( 6 i,..., 6 ^) } 


and 


So(ai , . . . , Q-m 5 ■ ■ • ; ) = max{ So(ai,..., a„) + So( 6 i ,..., 6771 ,), 

(^1 j ■ • ■ 7 ^n) Si (61, . . . , bjYi'j 


Lemma 2. Jointly distributed ±l-valued random variables A and B with given expectations 
{A), (B), (AB) exist if and only if 


[-1 < M) < 1, 

< -1 < (B) < 1, 

[\{A) + {B)\-1<{AB)<1-\{A)-{B)\. 

Lemma 3. Given jointly distributed random variables (^4, B) and jointly distributed random vari¬ 
ables {B',C), there exists a coupling {A,B,B',C) such that B = B' if and only if B and B' have 
the same distribution. 

Lemma 4. Jointly distributed ±1 random variables Ai,..., An (n > 2) with given expectations 
(Ai),..., (An),{A iA 2 ) ,..., {An-iAn) cxist if and only if A = At and B = satisfy the condi¬ 
tion of Lemma\^for all i = 1,..., n — 1. 

Theorem 5. Jointly distributed H-valued random variables Ai,..., An (n>3) with given expec¬ 
tations 

(^ 1 ) ) • ■ • 5 7 (^ 1 ^ 2 ) 5 ■ • ■ 5 {An — \Anj , i^AnAf) 

exist if and only if A = Ai and B = ^4^01 satisfy the condition of Lemma\^for all i = 1,..., n and 

Si (( 741742 ) , . . . , {An-lAn) , {AnAi)) <77,-2. 

The main result of Ref. |31 is the following theorem. 

Theorem 6. For each i = 1,... ,n (n > 2), let the distribution of a pair (Vi, Wiei) of ±l-valued 
random variables be given. Then, the following two statements are eguivalent: 

1. there exists a joint distribution (coupling) of the pairs such that for all i = l,...,n the 
probability 

Pr [Pi ^ ITi] 

in the joint is the minimum possible allowed by the marginal distributions of Vi and Wi (i.e., 
there exists a maximally noncontextual description), 

2. the main criterion 


Si((VW^Qx),l-\{V)-{Wf)\:i = l,...,n)<2n-2 


holds true. 
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3 The equivalence result 

In Ref. [5], the inequality 


Si ((R,IRz®i) :i=l,...,n)<n-2 + Y,\{V,)-{W,)\ 

i=l 

was derived by computer-assisted calculations as a criterion for the existence of a maximally non- 
contextual description for n = 3, n = 4, and n = 5, and it was conjectured that the same pattern 
would hold for all n > 2. In this section, we prove (in Theorem [T3] below) this conjecture by showing 
that the inequality shown above is equivalent to the main criterion of Theorem [S] 

Lemma 7. For any numbers oi,..., a„ S K (n > 2), exactly one of the following conditions hold: 

1. for some index k, the inequality ai > |afe| holds for all i ^ k. 

2. for some distinct indices j and k, the inequality aj -|- < 0 holds. 

Lemma 8 . Suppose that for some index k the inequalities Oi > Ok and > 0 hold for all i ^ k 
(this is implied in particular by condition 1 of Lemma\^. Then, 

Si (ui an) — ^ ( ai u/c- 
i^k 

If condition 1 of Lemma\^ holds^ then 

Sq (^1 ; • ■ ■ ) ^n) — ^ ^ ■ 

i 

Lemma 9. For each z = (n > 2), let (Vj,Wi 0 i) be a pair of jointly distributed random 

variables. For any combination of signs mi,..., m„ S {“Ij +1}; there exist another set of random 
variables 


R : = m^Vi, 

Wi : = m^Wi 

(with each pair (Ri,ITi©i) jointly distributed) preserving the values of Sq and Si of the product 
expectations: 

50 ((RWiei) : z = 1,... ,n) = so((RITi®i) : z = 1,... ,n), 

51 ((RWi©i) : z = 1,... ,n) = Si((RIT,ei) : z = 1,..., rz), 

and preserving the upper and lower bounds 

\m + {m)\-1 = \{v^) + {w^)\-1 

of the hypothetical connection expectation (ViWi) for all i = This implies in particular 

that the conditions m and m as well as the measure f26V to be defined later are all insensitive 
to negations of the physical properties. 
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Lemma 10 . One can always find such a configuration of signs mi,..., m„ G {~1) +1} ™ Lemma\^ 
that i= 1,... ,n, satisfy condition 1 of Lemma\^ and since the conditions ilS\) and m 

as well as the measure are all symmetrical w.r.t. permutations of the indices, one can generally 
assume that condition 1 of Lemma\^ is satisfied with k = n. 

Proof. Let k be the index that minimizes |(ViWi 0 i)| and let := 1. Then, define recursively 

_J + 1 ) > 0 , 

otherwise, 

for all i = fc © 1, /c © 2,..., fc © (n — 1). This implies {ViWic^i) = (VibLiei) > 0 for all 

i k and so (VifLiei) = l(^W/i©i)l > K^fcW^fe©fe)| for all i k and condition 1 of Lemma [7] is 
satisfied. □ 

Lemma 11. For any a, b,c,d G K, we have 

— |d + c| + |a — c| + |d — 5| — |a — 6 | <2 max{| 6 | ,\d\} , 

— \a — b\ — \d — c\ + \ \a — c\ — \d — b\\ < 0 . 

Proof. Using the triangle inequality \a — c\ < \a — b\ + \b — (—d)| + \{—d) — c|, we obtain 
— \d + c\ + \a — c\ + \d — b\ — \a — b\ 

< -|d + c| + (|a -b\ + \b- (-fi)l + \{-d) — c|) + \d - b\ - \a - b\ 

= \b + d\ + \d — b\ = 2 max{| 6 |, |(i|} , 

which is the first inequality. The latter inequality follows as the conjunction of the following two 
triangle inequalities 


|a — c| < |a — 5| + Id — c| + |d — 5|, 
\d — b\ < |a — 6 | + |d — c| + |a — c|. 


□ 

Lemma 12. For all i = l,...,n (n > 2), let {Vi,Wi) be a pair of jointly distributed ±l-valued 
random variables satisfying (ViWi) = 1 — |(Ui) — {Wi)\ and let pi,..., pn satisfy 


\{Vf) + {mei)| - 1 < P* < 1 - KU) - {VL,®i)| (10) 

for i = 1,... ,n. If there exist distinct indices j and k such that 

{l-\{V,)-{W,)\) + {l-\{Vk)-{Wk)\)<0, (11) 

then, there exists a joint distribution of the pairs {yi,Wi), i = l,...,n, satisfying {ViWii^i) = pi 
fori = l,...,n. 

Proof. We prove the statement by induction. For n = 2, denoting (VjWi®!) = pi and {ViWi) = 
1 — \{Vi) — {Wi)\ for 1 = 1,2, Theorem [5] implies that a joint exists if 


Si {{ViWf), {V2W1 ), {ViWi ), {V2W2)) < 2 , 


where, without loss of generality (by Lemmas and ITUll. we assume {V 1 W 2 ) > |(V 2 W'i)|, and 
condition (fTTt yields (FiVLi) + {V 2 W 2 ) < 0. Thus, we have by Lemma [T] 


Si ((V1IT2) , (F2IT1) , (FilTi) , {V2W2)) 

fsi {{V1W2 ), {V2W1)) + So {{ViWi ), {V2W2)) 

- II18;X \ 

\so {{V1W2 ), {V2W1)) + Si {{ViWi ), {V2W2)) 

I {V1W2) - {V2W1) - {ViWi) - {V2W2) 

- max ^ ^ l(FiVLi) - {V2W2)\ 

r(l-|(yi)-(lT2)|)-(|(y2) + (VLi)|-l) 

< I -(l-|(Fi)-(LLi)|)-(l-|(y 2 )-(VL 2 )|) 

(l-|(yi)-(lT 2 )|) + (l-|(F 2 )-W)|) 

I + 1(1 - l(l"i) - (W^i)l) - (1 - 1(1^2) - (VL2)|)| 


- 1(1^2) + (ILi)l + |(lh) - (ILi)l + 1(1^2) - (11^2)1 - KVd) - ( 1 ^ 2)1 
2 _ |(Fi) _ {W 2 )\- m) - (ITi)l + IKVi) - (ITi)l - 1(1/2) - (VL 2 )|| 


< max 


2 niax{|(lT 2 )|,|(y 2 )|} 

2 


< 2 , 


where the two inequalities follow from respectively Lemma [2] and Lemma ITT] Thus, the statement 
holds for n = 2. 

Assuming then that the statement holds for all systems smaller than n (n > 3), we prove it for 
a system of size n. With no loss of generality, assume j = 1 < k < n. By Lemma 01 there exists 
a joint for the chain {Vk,Wk+i, ■ ■ ■, 14 ,14i) satisfying {ViWi^i) = pi for * = fc,..., n and this joint 
has a certain marginal of (14, ITi) satisfying by Lemma [2] the range 


1(14) + (M"i)| - 1 < {VkW^) < 1 - 1(14) - (ILi)l. 


By the induction assumption, there exists a joint of (14, IT 2 ,..., 14, ILi) satisfying (14114ei) = Pi 
for z = 1,..., fc — 1 and matching the marginal of (14,114) mentioned above. By Lemma [31 it follows 
that there exists a joint of all (14,114) satisfying (14114ei) = Pi for z = 1,..., n. □ 

Theorem 13. For each z = 1,..., n (n >2), let the distribution of a pair (I 4 ,1140i) of ±l-valued 
random variables be given. Then, the main criterion 


si((yzlT,©i),l-1(14)-(114)1 :* = l,...,n) <2n-2 (12) 

is equivalent to 

n 

Si((141T,©i) :z = l,...,n) <n-2 + ^|(14)-(lT,)|. (13) 

i=l 

Proof. First, without loss of generality (by Lemmas [9l and [T0)l . we assume |(14114)| < (141140i), 
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i = 1,... ,n — 1. By Lemma [51 this condition implies 

n 

somW,^,):i = l,...,n) = (14) 

n — 1 

■.i = l,...,n) = ^ ('L.W',®!) - (14B^i). (15) 

i=l 

By Lemma [1] the criterion (1121) is equivalent to the conjunction of 

Si i = 1,..., n) + So (1 - ICL,) - (VL,)I : z = 1,..., n) < 2n - 2, (16) 

So ((V^.W^.ei) : z = 1,..., n) + Si (1 - |(y,) - (B^,)I : i = 1,..., n) < 2n - 2. (17) 

Case 1. Suppose now that that the terms 1 — |(Vj) — (Wi)| satisfy for some k the condition 
1 — 1(14) — (114)1 > |1 — 1(14) — (Ihfe)ll for all i ^ k. Then, Lemma [5] yields 

n 

So{l-m)-{W,)\■.i = l,...,n) = n-Y,m)-{W^)\, (18) 

Si (1 - |(B,) - (1T,)| : z = 1,... ,n) = u - 2 - ^ 1(14) - (114)| + |(14) - {Wk )\• (19) 

i^k 


Now (fT51) implies that (ITC)) is equivalent to (fT51) . Furthermore, using m s>iicl (|14p . tliG left side of 
El) becomes 

n 

^ (141Li©i) + rz - 2 - ^ |(B,) - (1F,)| + |( 14 ) - (lFfc)| 

i—\ i^k 

n 

<2n-2-'£m- (lF,®i)| - ^ 1(14) - (114)1 + 1(14) - {Wk)\ 

i—1 i^k 

which, by the triangle inequality 

n 

1(14) - (11^01 < E + E (20) 

2=1 i^k 

implies dm) (i.e., dm always holds in Case 1). It follows that the two conditions for maximal 
noncontextuality are equivalent under the assumption of this case. 

Case 2. Suppose then that the assumption of Case 1 does not hold. Then, by Lemma 0 the 
condition ([111) of Lemma Upholds and so Lemma [T51 implies that a joint exists, which, by TheoremjS] 
implies that dm holds. However, the condition dm also yields \ {Vj) - {Wj) \ + |(14) - (114)| > 2 
which implies that that the right side of (fT5|) is at least n whereas the left side cannot exceed n and 
so dm holds true as well. Thus, the two conditions for the existence of a maximally noncontextual 
description are equivalent under this case as well. □ 


4 Proof of the conjecture on the measure of contextuality 

In this section, we prove the following theorem, which was shown to be correct for the special cases 
of rz = 3, n = 4, and n = 5 and conjectured to hold for all rz > 2 in Ref. [5]: 


10 


Theorem 14. For each i = 1,... ,n (n >2), let the distribution of a pair (Vi, Wi^i) of ±l-valued 
random variables be given. Then, the minimum possible value of 

n 

A = ^ Pr [F, ^ W,] 

2=1 


over all possible joints of the given pairs is 

1 Jsi ((P,Wi©i) : i = 1,.. .,n) - (n - 2), 

Remark 15. Here the bottom expression for Amin corresponds to the case that a maximally noncon- 
textual description exists, i.e., that all probabilities Pr [Vi ^ Wi], i = 1,... ,n, are at their minimum 
values allowed by the marginals. In Ref. [2], the excess of Amin over its minimum possible value 
i X]r=i K^) “ (^i)l given the marginals is defined as a measure of contextuality. 

To prove Theorem 1141 we will first rewrite it in a more accessible form for our proof. Noting 
that for a pair of ±l-valued random variables, the probability Pr [Vi ^ Wi] is fully determined by 
the expectation (RWi) through the identity 


(VWf) = l-2W[V,^Wi\, (21) 

we see that minimizing the probability Pr [R ^ Wi] is equivalent to maximizing the connection 
expectation (RWi). Hence, a description is maximally noncontextual if and only if the connection 
expectations satisfy 

(VWf) = l-[(V)-(Wf)[, i = l,...,n, 

that is, all the connection expectations are at their maximum values as given by Lemma [51 Using 
m, we can rewrite Theorem [TJ] in the following equivalent form: 

Theorem 16. For each i = 1, ... ,n (n >2), let the distribution of a pair (Vi, Wi©i) of ±l-valued 
random variables be given. The maximum possible value of 

n 

S = Yi {V,Wi) 

2=1 


over all possible joints of the given pairs is 

. j2n-2-Si((VWiisi) :i=l,...,n). 

The proof needs several lemmas: 

Lemma 17. Suppose oi,..., a„ G [—1,1]. Then, So(ai,..., a„) + Si(ai,..., a„) <2n — 2. 

Lemma 18. Jointly distributed ±l-valued random variables Hi,..., An (n> 2) with expectations 
(Hi),..., (An ), (H 1 H 2 ),..., (H„_iH„), (H„Hi) exist if and only if A = A,, and B = H^©! satisfy 
the condition of Lemma\^for all i = 1,... ,n and 

So ((^ 1 ^ 2 ) ) • ■ • ) (-^n-l-t^n)) — (n — 2) < (H„Hi) < (n — 2) — Si ((H 1 H 2 ) , . . . , (An-lAn)) . 
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Furthermore, the range given by the above inegualities is always nonempty and intersects the range 

K^n) + (^l)l ~ 1 £ (^n^l) < 1 ~ K^n) ~ (^l)l 


given by Lemma\^ 

Proof. The first part is a direct corollary of Theorem [5] and Lemma [T] The fact that the two 
ranges intersect follows from the fact that a joint always exists for the chain with expectations 
{AiAf ), ■ ■ ■, {An-iAn) and this joint yields some 2-marginal for {An,Ai) which satisfies Lemma[5J 

□ 

Lemma 19. For each i = 1,... ,n (n > 2), let the distribution of a pair {Vi,Wii^i) of Fl-valued 
random variables be given. Suppose that the main criterion m does not hold and formally define 
the expressions 


S:=Y, , 

M:=2n-2-si((y.lT,©i) :i = l,...,n). 

If there exists some values {{VAVi) : z = 1,... ,n} (called connection vector) satisfying the condition 
of Lemma\^for each i = 1,... ,n, the condition 

{VkWu) = l-\{Vu)-{Wu)\, 

{ViWf)>\l-\{Vk)-{Wk)\\,i^k 

for some index k, and the ineguality S < M, then there exist another connection vector 
{{ViWi) : i = 1,... ,n} such that a joint of the given observable pairs having these connection ex¬ 
pectations exists and satisfies the eguation S = M. 

Proof. Consider the set of all connection vectors {{ViWi) : i = 1,... ,n} satisfying the condition of 
Lemma[5]for each pair (fy, Wi), i = 1,... ,n, and the condition (12^ . Within this set we have the as¬ 
sumed configuration satisfying S < M and the configuration (ViWi) = 1 —|(fy) — {Wi)\, i = 1,... ,n, 
which yields S > M due to (IT^ not being satisfied. Since S' is a continuous function of the vector 
{(ViWi) : z = 1, ..., n} and the conjunction of the condition of Lemma[2]and condition (j22p defines 
a connected set of connection vectors, there exist a configuration {(ViWf) : z = 1 ,..., rz} satisfying 
the condition of Lemma [5] and with equality S = M. For this conhguration, expanding the 
definitions of S and M we obtain 


Si ((fylF,©i) : z = 1,..., zz) + ^ {V,W0 = 2n-2. 

Also, since ([55]) implies condition 1 of Lemma [7] we obtain from Lemma [5] 

n 

Si ((fyw,) : Z = 1,... ,rz) = ^ (fyW,) - (1 - \{Vu) - {Wk)\). 

i^k 


(23) 


(24) 

(25) 
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Now (1^ and (IMl) imply 


s^{{V,W^Q^)■.i=l,...,n)+SomW,)■.i=l,...,n) = 2n-2 


and Lemma m applied to (1^ and (051) yield 

So ((VjWiei) : z = 1 ,... ,n) + Si {{ViWi) : z = 1,... ,n) 

n 

< E (1 -+E (1 -- (1 - 

i—l i^k 

n 

= 2n-2+\{Vu)- {Wu)\ -Y^m)- {W,m)\ - E ' 

i—l i^k 

"-V-" 

<0 (by the triangle inequality |20j) 

Thus, by By Lemma [T] and Theorem [SJ a joint exists. □ 

Now we are ready to prove the main result. It suffices to prove the following partial statement 
(Theorem [16] and hence also Theorem |T4| follow immediately): 

Theorem 20. For each i = 1,... ,n (n>2), let the distribution of a pair (Vi, ITiqi) of ±l-valued 
random variables be given. If the main criterion m does not hold, then the maximum possible 
value of 

n 

s = Y 

i=l 

over all possible joints of the given pairs is given by 

M = 2n - 2 - Si ((ViWi^i) ■. i = I,... ,n) > n - 2. (26) 

Proof A larger value cannot be reached since we have 

2 n — 2 

(TheoremjS]) > Si ((yWi©i), {VWi) :i = l,...,n) 

(Lemma]!]) > Si ((t^iWi®i) : I = 1,..., n) + Sq ((ViWi) :i = l,...,n) 

> Si ({ViWiQ)i) : i = 1,..., n) + S'. 

To show that this value can be reached, without loss of generality (by Lemmas [Hi and [TO assume 

> |(KW"i)|, ^ = l,...,r^-l, (27) 

and since the criterion m is not satisfied, by Lemma m the maximal connection expectations 
1 — \ {Vi) — {Wi)\ satisfy for some k the condition 

l-\{Vi)-{W,)\>\l-\{Vk)-{Wk)\\, I^k. (28) 

We will show that a specific configuration {(ViWi) : i = 1,..., n} exists for which S = M and for 
which a joint of the pairs (Vj, Wn^i), i = 1,... ,n, exists. This is shown separately for four cases 


13 



that exhaust all possible situations under the present assumptions. The four cases are defined as 
follows: 


Vi : |(y,) + {W ^)\- 1 < |1 - K^fc) - {Wk)\\l 



Casel l-\{Vk)-{Wu)\>W 



Case 1. We have for some index j, 

\{Vj) + {W ,)\- 1 > |1 - 1(14) - m)\\. (29) 

In this case, we define {ViWi) = 1 — \{Vi) — {Wi)\ for all i ^ j and choose an arbitrary value 
— K^) + (^t)| ~ 1 from the range given by Lemma[T51 (intersected with the range given by 
Lemma[2|). Since a joint exists by Lemma ITSl for this configuration, we obtain S < M. Furthermore, 
since ([28ll and ([29ll imply that condition (l2^ is satisfied for this configuration. Lemma [19] implies 
that a joint exists for some configuration with S = M. 

Case 2. We have 


l-|(I4)-(bFfc)l <0, 

\{Vj) + {W,)\-l<\l-\{Vk)-{Wk)\\, j^k. (30) 

In this case, we choose an index j ^ k and define (VjWj) = |1 — |(t4) — (bFfc)|| = 
— (1 — 1(14) — (114)I) and (ViWi) = 1 — \{Vi) — {Wi)\ for i ^ j. These values satisfy condition 
(1^^ by ((^51) and the condition of Lemma|i|by (0^ and (|5U1) . Now {VjWj) + (14114) = 0 implying 
that S < n — 2 < M and so by Lemma [TOl a joint exists for a configuration with S = M. 

Case 3. We have 


0 < 1 - 1(14) - (ITfe)l < (KbFi), 

|(l"z) + (lT,)|-l<l-|(14)-(lTfe)|, i = (31) 

In this case, we define (14114) = 1 — 1(14) — (114)1 > 0 for all i = 1,... ,n and condition (l22ll is 
satisfied trivially. These values also satisfy the condition of Lemma|2|by (l2^ and (|31ll . Furthermore, 
by (I27jl and m, all product expectations {(14114 ®i), (ViWi) : i = 1 ,... ,n} are nonnegative and 
the smallest value among them is (14114) = 1 — |(14) — (Hfe)!- So, by LemmajSJ we can expand 

Si ((141T.®i), 1 - 1(14) - (ITfc)l : i = 1,..., n) 

n 

= Y. + (n - 2) (1 - 1(14) - (ITfc)l) 

i=l 

< n + (n — 2) <2n — 2. 
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Thus, a joint exists by Theorem [S] which implies that S < M and so by Lemmajinia joint exists for 
a configuration with S = M. 

Case 4. We have 


{VnWi)<l-\{Vk)-{Wk)\>0, (32) 

m) + (W',)| - 1 < 1 - KFfc) - (Wfe)l, i = 


In this case, we define {ViWi) = 1 — |{I4) — (II4)| > \{Vi) + {Wi) \ — 1 for i 7^ fc and (051) implies 
that these values satisfy the condition of Lemma [2] Thus, a joint exists by Lemma [18] for 


(VfcVLfc) = min 


2u-2-Si({y,W®i) :i = 1,.. 

i-|(14)-TOI- 


n] {ViW^) 


(33) 


Assume first that the top expression in (1551) is the minimum. Since here {VnWi) is by (1571) and 
(1551) the smallest argument to Si (...) in (1551) and all other arguments are nonnegative Lemma [5] 
yields 


Si((y*IT,®i) : i = 1 ,..., n; (F,W,) : i 7 ^ fc) 

n—1 

= Y. + (^ - 1) (1 - \{Vk) - {Wu)\) 

= Si ((F,W,®i) : z = 1,. .. ,n) + (n - 1) (1 - 1(14) - (W"fc)l) 

= Si ((F*W,®i) ■.i = l,...,n) + S- (VkWk). 


Substituting this in (1551) yields 


5 = 2n - 2 - Si ((t/,Wi®i) : i = 1 ,... ,n) = M. 

Suppose then that the bottom expression in (1331) is the minimum. Then, we have 


(14W) = 1 - 1(14) - (ITfc)l > 0, i = 


and condition (1551) is satisfied and since a joint exists, it follows S < M. But then by Lemma ITOl a 
joint exists for a configuration with S = M. □ 


5 Conclusion 

We have derived the formula ([5]) for contextuality measure in cyclic systems with binary variables, 
previously conjectured based on the principles laid out in Refs. [5115]. The measure is based on the 
principled comparison of the minimal values Aq and Amin of 

n 

A = ^Pr [R. 7^M4], 

1=1 

with Aq computed across all couplings for the individual pairs {14, Wi} (i.e., by minimizing each 
Pr [14 7 ^ 114 ] separately), and with Amin computed across all couplings for the entire set of the 
random variables {14, Wi : i = 1,..., n\. 

A logical consequence of this measure is the criterion (necessary and sufficient condition) for 
contextuality ([7]). In this paper, however, we proved ([7]) by showing its equivalence to the criterion 
([5]) derived in Ref. |5], and used this equivalence to derive ([5]). 
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